Using character overlap to improve language transformation
نویسندگان
چکیده
Language transformation can be defined as translating between diachronically distinct language variants. We investigate the transformation of Middle Dutch into Modern Dutch by means of machine translation. We demonstrate that by using character overlap the performance of the machine translation process can be improved for this task.
منابع مشابه
A Linguistic Account of the Protagonist’s Development in the Grapes of Wrath
The novel as a modern literary genre is generally regarded as the realization of its main character's journey from immaturity to a status of maturity. The character, usually an uncomplicated person unable to cope with the complexities of life at first, gains an insight and understanding to handle his/her complex situation accordingly later in the novel. It is usually agreed in both literary cri...
متن کاملA new model for persian multi-part words edition based on statistical machine translation
Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...
متن کاملPygmalion in Conversation with Pierre Bourdieu:A Sociological Perspective
George Bernard Shaw's masterpiece Pygmalion deals with the social function of language and reveals that Linguistic Competence is one of the markers of social status. It presents the story of the social transformation of a flower girl into a ‘lady’ through linguistic retraining. This work has been analyzed from a variety of perspectives such as Freudian psychology and sociolinguistic perspective...
متن کاملطراحی و ساخت کانسترکت نوترکیب واجد ژن اینترفرون بتای جهش یافته در ناحیه کزاک (Kozak) به منظور تشدید ترجمه
Background: Interferon beta is one of the most important members of group I interferons and is the main drug for multiple sclerosis treatment. Interferon beta has short half life and this compels patients to make frequent use of medicine. According to its clinical usage there is broad effort to improve translation level and protein production. There are several important factors which effect pr...
متن کاملTwitter Paraphrase Identification with Simple Overlap Features and SVMs
We present an approach to identifying Twitter paraphrases using simple lexical overlap features. The work is part of ongoing research into the applicability of knowledgelean techniques to paraphrase identification. We utilize features based on overlap of word and character n-grams and train support vector machine (SVM). Our results demonstrate that character and word level overlap features in c...
متن کامل